Assemblies were constructed with Helianthus EST sequences available on Genbank 9 Sep 2007 and additional ESTs from Steve Knapp's lab.  The assembly pipeline starts with Seqclean to remove poor quality sequences, clip poor quality ends, and trim vector.  Prior to contiging, a library-less repeat masker, RBR, was used to dynamically identify and mask repeats with the default settings.  Contigs were constructed using TGICL with default settings, and a final file called *.unigenes containing contigs and singletons was constructed.


A total of 33,376 unigenes were assembled from a starting pool of 94,706 ESTs for Helianthus annuus.

A total of 87,237 unigenes were assembled from a starting pool of 284,340 ESTs for all species of Helianthus.


All files produced during this assembly are available in each assembly archive.
